No config defaults changed since last commit.
| Parameter | Value |
|---|---|
total_samples | 10000000 |
batch_size | 8 |
stage_samples_multiplier | 100000000000 |
update_interval | 250 |
window_size | 100 |
num_best_models_to_keep | 1 |
sampling_mode | Loss-weighted |
loss_weight_temperature | 0.5 |
loss_weight_refresh_interval | 50 |
stop_on_divergence | True |
divergence_gap | 0.002 |
divergence_ratio | 1.5 |
divergence_patience | 50 |
divergence_min_updates | 10 |
val_spike_threshold | 2.0 |
val_spike_window | 15 |
val_spike_frequency | 0.75 |
val_plateau_patience | 250 |
val_plateau_min_delta | 0.0001 |
custom_lr | 0.0001 |
disable_lr_scaling | True |
custom_warmup | -1 |
lr_min_ratio | 0.001 |
resume_warmup_ratio | 0.05 |
plateau_factor | 0.8 |
plateau_patience | 15 |
preserve_optimizer | False |
preserve_scheduler | True |
samples_mode | Train additional samples |
num_random_obs_to_visualize | 2 |
selected_frame_offset | 3 |
runs_per_stage | 5 |
serial_runs | True |
clean_old_checkpoints | True |
enable_baseline | False |
baseline_runs_per_stage | 1 |
run_id | shoulder_session_multiheight_encoder_decoder_variable_mask_ratio |
enable_wandb | True |
wandb_project | developmental-robot-movement |
lr_sweep.lr_min | 1e-07 |
lr_sweep.lr_max | 0.01 |
lr_sweep.phase_a_num_candidates | 5 |
lr_sweep.phase_a_seeds | 1 |
lr_sweep.phase_a_time_budget_min | 3.0 |
lr_sweep.phase_a_survivor_count | 2 |
lr_sweep.phase_b_seeds | 3 |
lr_sweep.phase_b_time_budget_min | 10.0 |
lr_sweep.ranking_metric | median_best_val |
lr_sweep.min_samples_before_timeout | 1000 |
lr_sweep.min_evals_before_stop | 5 |
lr_sweep.save_sweep_state | True |
plateau_sweep.enabled | True |
plateau_sweep.plateau_ema_alpha | 0.85 |
plateau_sweep.plateau_improvement_threshold | 0.0015 |
plateau_sweep.plateau_patience | 25 |
plateau_sweep.cooldown_updates | 5 |
plateau_sweep.max_sweeps_per_stage | 2 |
plateau_sweep.min_sweep_improvement | 0.0 |
initial_sweep_enabled | True |
stage_time_budget_min | 180 |
| Parameter | Value |
|---|---|
AUTOENCODER_LR | 0.0002 |
BATCH_SIZE | 1 |
CANVAS_HISTORY_SIZE | 3 |
DECODER_ONLY_DEPTH | 10 |
FOCAL_BETA | 5 |
FOCAL_LOSS_ALPHA | 0.1 |
FRAME_SIZE | (224, 224) |
GRADIO_UPDATE_INTERVAL | 1 |
LR_MIN_RATIO | 0.001 |
MODEL_TYPE | encoder_decoder |
PATCH_SIZE | 16 |
PERCEPTUAL_LOSS_WEIGHT | 0 |
SEPARATOR_WIDTH | 16 |
WARMUP_STEPS | 1000 |
WEIGHT_DECAY | 0.01 |
MASK_RATIO_MIN | 1 |
MASK_RATIO_MAX | 1 |
TRAIN_MASK_RATIO_MIN | 0.5 |
TRAIN_MASK_RATIO_MAX | 1.0 |
| Stage | Plateau Sweeps | Sweep Time | Training Time | Stage Total |
|---|---|---|---|---|
| Stage 1 | 7 | 01:48:08 | 00:15:55 | 02:04:03 |
| TOTAL | 7 | 01:48:08 | 00:15:55 | 02:04:03 |
Initial LR Sweep: Stage 1: selected LR 3.16e-05 in 00:14:51
LR Progression: 3.2e-05 → 3.2e-05 → 3.2e-05 → 3.2e-05 → 3.2e-05 → 3.2e-05 → 1.8e-06 → 1.8e-06
| Sweep # | Triggered At (samples) | Wall Time | Selected LR | Duration |
|---|---|---|---|---|
| 1 | 11,520 | 00:01:56 | 3.16e-05 | 00:15:11 |
| 2 | 22,528 | 00:18:57 | 3.16e-05 | 00:15:40 |
| 3 | 29,696 | 00:35:48 | 3.16e-05 | 00:15:37 |
| 4 | 44,544 | 00:53:53 | 3.16e-05 | 00:15:40 |
| 5 | 60,160 | 01:12:07 | 3.16e-05 | 00:15:18 |
| 6 | 78,592 | 01:30:28 | 1.78e-06 | 00:15:31 |
| 7 | 87,808 | 01:47:32 | 1.78e-06 | 00:15:09 |
| Stage | Best Loss | Stop Reason | Samples Trained | Time | Sweeps | LR (Initial→Final) |
|---|---|---|---|---|---|---|
| Stage 1 | 0.036345 | max_sweeps (2) | 7,680 | 02:04:03 | 7 | 3.2e-05→1.8e-06 |
Total Plateau Sweeps: 7
| Run | Best Loss | Stop Reason | Samples | Time | Selected |
|---|---|---|---|---|---|
| 1 | 0.042132 | max_sweeps (2) | 14,848 | 01:44:48 | |
| 2 | 0.118392 | max_sweeps (2) | 7,936 | 00:35:28 | |
| 3 | 0.115258 | max_sweeps (2) | 9,984 | 00:36:19 | |
| 4 | 0.044837 | max_sweeps (2) | 11,520 | 01:44:39 | |
| 5 | 0.036345 | max_sweeps (2) | 7,680 | 02:04:03 | ✓ |
| Mean: 0.071393 ± 0.037210 | Min: 0.036345 / Max: 0.118392 | Range: 0.082047 | |||
| Stage | Orig Loss | Train Loss | Time | Samples | Stop Reason |
|---|---|---|---|---|---|
| 1 ⭐ | 0.044051 | 0.036345 | 02:04:03 | 7680 | max_sweeps (2) |